CDS

Accession Number TCMCG011C21111
gbkey CDS
Protein Id XP_021906640.1
Location join(2897627..2897833,2898791..2898922,2899064..2899228,2899321..2899657,2900129..2900210,2900282..2900390,2900501..2900575)
Gene LOC110821202
GeneID 110821202
Organism Carica papaya

Protein

Length 368aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264084
db_source XM_022050948.1
Definition probable polygalacturonase At1g80170 isoform X1 [Carica papaya]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 28 family
KEGG_TC -
KEGG_Module M00081        [VIEW IN KEGG]
KEGG_Reaction R01982        [VIEW IN KEGG]
R07413        [VIEW IN KEGG]
KEGG_rclass RC00049        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01213        [VIEW IN KEGG]
EC 3.2.1.67        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00040        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00040        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005618        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0009505        [VIEW IN EMBL-EBI]
GO:0030312        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAAGAACTTGGGATTTTGTTTATCTTCTAAATCTCTTCTTATTTTTCTTTATCTATATGTATTTCTCTCCTCAAAAATTGAAGGATTTGAGACTCTGCTACAACTCCCACGTTCTGGGTCCTCCGAAATCCGACCCAGATCCTCACGGGTCATCACCATTGCTGATTTTGGTGCCAAAGGAGATGGTCTTACTGATGATACTCAGGCTTTTGAAAATGCTTGGAAGATGGCTTGTTCTTGCCCTGCTCGGTCAATTCTCCTAGTACCAGCTGGATTTACTTCGCTAGTCCATCCTATTGATTTTGCTGGACCATGTAGGTCAAAAGTGACTCTGCAGATATCTGGTGCTATTGTTTCTCCGAAAAATCCTGAATCCTGGACTGGCTTAAACCCTCGGAAGTGGCTCTACTTCCATGGGGTGAACCATCTTCAAGTGGAAGGGGGAGGGACTATCAACGGAAGGGGACAAAAATGGTGGGACTGCAAAGTCAATAAAACAAGTCCTTATCATCGTGCCCCAACAGTAAGAGCCTCATTTTGTTGTTCCATTTGTCTTTTTCAATATTGGATTCTTCTTTCCCTTTCTAATGGTATTGCCATGGGCTTCATATTGTTAACAATTATACTGCAGGCCATGACATTCCACAGGTGCAAGAATTTGAAAGTGCATAACCTTAAATTGATAAATAGTCAGCAAATGCACATAGCATTCACTACCTGCAATCAGGTTAAGGCATCCCATCTTGAAGTTATAGCACCTGCTTCCAGCCCCAATACGGATGGAATCCACATTAGTAATTCTCATAATGTTAAGATCAAAAATAGCATCGTCAGAACAGGAGACGACTGCATCTCAATAGTCAGCAATTCCTCAAAGATCCAAATCAGAAACATTTTCTGTGGGCCAGGCCACGGCATAAGCATTGGGAGCCTAGGGGAATCGGGCTCATGGGTACAGGTGCATGATGTAAAGGTTGATGGAGCATTCCTGTTCAACACTGATAATGGATTGCGGATTAAAACATGGCAGGGAGGTAATGGTTTTGCTTCTGATATCAAATTCCAGAATATTTTGATGGAGAATGTATTAAACCCAATTATATAG
Protein:  
MKNLGFCLSSKSLLIFLYLYVFLSSKIEGFETLLQLPRSGSSEIRPRSSRVITIADFGAKGDGLTDDTQAFENAWKMACSCPARSILLVPAGFTSLVHPIDFAGPCRSKVTLQISGAIVSPKNPESWTGLNPRKWLYFHGVNHLQVEGGGTINGRGQKWWDCKVNKTSPYHRAPTVRASFCCSICLFQYWILLSLSNGIAMGFILLTIILQAMTFHRCKNLKVHNLKLINSQQMHIAFTTCNQVKASHLEVIAPASSPNTDGIHISNSHNVKIKNSIVRTGDDCISIVSNSSKIQIRNIFCGPGHGISIGSLGESGSWVQVHDVKVDGAFLFNTDNGLRIKTWQGGNGFASDIKFQNILMENVLNPII